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1.  INTRODUCTION 


Linear  regression  models  are  widely  used  in  statistical  analysis  of 
experimental  and  observational  data.  Usually  the  linearity  of  the  model  is 
merely  an  assumption  and  cannot  be  taken  for  granted.  In  some  planned  experi¬ 
ments,  repeated  measurements  on  the  dependent  variable  Y  can  be  taken  while 
the  independent  variable  X  is  held  fixed.  In  such  cases  standard  analysis- 
of-variance  technique  can  be  employed  to  generate  a  test  for  linearity.  In 
many  applications,  however,  the  independent  variable  is  observed  simultane¬ 
ously  with  Y.  That  is  to  say,  X,  as  well  as  Y,  is  a  random  variable.  Under 
such  circumstance  the  usual  method  for  testing  linearity  cannot  apply. 

hr  this  paper  we  shall  study  this  problem  in  the  large-sample  context. 

We  propose  a  method  to  test  the  linearity  hypothesis  based  on  a  grouping  of 
the  data.  The  critical  value  of  test-statistic  is  determined  so  that  the 

f  ' ,  < 

test  has  a  prescribed  lever  of  significance  d  asymptotically  as  the  sample 
size  tends  to  infinity.  The  consistency  of  the  test  is  established,  and 
the  asymptotic  power  is  calculated  when  the  distance  (in  some  sense)  between 
the  true  regression  function  and  the  space  of  linear  functions  tends  to  zero 
in  some  specific  rate. 


2.  ASSUMPTIONS  OF  THE  MODEL 


In  this  section  we  shall  give  a  detailed  account  of  our  assumptions. 

We  shall  adhere  to  these  assumptions  in  the  sequel. 

A  basic  assumption  is  that  the  conditional  distribution  of  Y  given 
T  *  x  has  a  form  ( F(y  -  m(x)),  i.e.,  P(Y  <  y|7  =  x)  =  F(y  -  m(x)),  where 
F  is  a  fixed  distribution  function.  We  do  not  assume  F  to  be  known.  Let 
****  ^-n'Yn^  11C*‘  °*3servat'ions  (2»y)*  Then,  under  this 

assumption,  we  can  give  Y.  a  more  convenient  expression,  as  follows: 

Y.  =  m(][. )  +  e.. ,  i  =  1 . n  (1) 

where  e^,  e2»  ...  are  iid.  with  common  distribution  function  F,  and 
Ij,  ...,  7n,  ej,  ...,  en  are  mutually  independent.  Also,  Eej  =  0.  We 
further  assume  that 

1°.  The  moment  generating  function  of  e^  exists  in  some  neighborhood 
of  zero. 

2°.  The  variance  cr  of  ej  (whose  existence  follows  from  1°)  is  positive. 
Denote  by  u  the  probability  distribution  of  7.  We  assume  that 
3°.  E||X||2  <  »,  and  COV(I)  >  0  (j|*||  is  the  Eculidian  norm). 

4°.  v  has  no  singular  component,  and  if  u  has  an  absolute  component 
with  density  f,  then  for  sufficiently  small  a  >  0  there  exists  an  open  set 
G  ,  such  that 

d 

|  G  Mx :  f ( x )  >  a)  |  =0 

G 

where  |  A|  denotes  the  Lebesgue  measure  of  A,  and  A  is  symmetric  differ¬ 
ence.  This  condition  is  a  rather  weak  one  since  it  is  satisfied  by  such  f 
whose  discontinuity  points  all  lie  in  a  closed  set  with  Lebesgue  measure  zero. 

For  brevity,  in  the  sequel  we  shall  use  "model  (1)"  in  the  sense  that 
all  of  the  above  assumptions  are  met. 
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3.  CONSTRUCTION  OF  THE  TEST 


Choose  two  numbers  and  with  the  condition 


0  <  e1  <  e2  <  1/3.  (2) 

Some  further  restrictions  on  the  choice  of  e^,  will  136  needed,  which 

will  be  stated  in  Section  5. 

-(1/3-e  )/d  d 

Put  An  =  n  .  Decompose  R  into  a  set  J*  of  supercube  having 

the  form 

{(x(1),  ....  x^):  a.Jin  <  <  (a.+l)in,  1  «  1,  ....  d),  (3) 


ai  =  0 ,  _+l ,  +2 ,  i  =1,  . . . ,  d . 


For  J  e  J*,  use  #(J)  to  denote  the  number  of  elements  in  the  set 


J  H  {Xj,  ....  Xn>.  Write 


2/3+e, 

{J:  J  e  J*,  #(J)  >  n  1 


}  ”  {  J  1  f  J  n  J  «  .  »  ,  J  1. 
nl  nZ  nc 


Obviously , 


1/3-e, 


c  <  n 
n  — 


Further,  let 


Jni  n  {-l»  -"’-n}  {-ni^  ’  -ni^  ’  *  *  *  *  — n  i  ^ n  i  ^ } 


We  have  by  definition 


(2/3)+e, 


n.  >  n 


,  i  =  1 ,  •  •  • »  cn • 


We  shall  write  Y. ( j )  and  e  .(j)  for  Y.  and  e.  ,  when  X  - ( j )  =  X.  . 

n  i  m  k  k  — n  i  — k 

Denote  by  7  .  the  arithmetic  mean  of  Ini- (j),  j  =  1,  n. .  Similarly 


■' -vvVV.V.V.v.vvv.  v.y 
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,  »f.  .• 


RSSn  tends  to  be  large  when  the  null  hypothesis  Hq  is  not  true,  and 
this  suggests  a  test  for  Hq.  Reject  Hq  when 


RSS  /a  >  C 
n  n 


2  2 
and  accept  Hq  when  (12)  does  not  hold.  an  is  a  suitable  estimate  of  a  . 


to  be  defined  in  Section  5.  In  order  to  determine  C  in  (12),  we  must  study 


the  asymptotic  distribution  of  the  statistic  RSS  /cr  under  Hn.  This  will 

n  n  u 

be  done  in  the  following  two  sections. 


4.  ASYMPTOTIC  DISTRIBUTION  OF  THE  RESIDUAL  SUM  OF  SQUARES 
The  purpose  of  this  section  is  to  prove  the  following: 

THEOREM  1.  Let  P*  =  P*(_Xj>  X2,  *  •••'  the  conditional  distribution 

given  { _Xj ,  •  ••)•  Under  model  (1)  and  when  Hq  is  true,  there  exists 

random  variable  whose  conditional  distribution  given  (7^ ,  X?>  •  ••)  is 
2 

xc  such  that  with  probability  one  we  have 
n  a 

RSS/a2  -  5n  0,  as  n  -  (13) 

Proof:  First  we  proceed  to  show  that 


— (n)^(n)— (n)/n  — *  COV(X),  *-S-.  as  n  -  (14) 

For  this  purpose,  denote  the  atoms  of  7  by  a^,  a2>  ....  and  put  p^  =  P 
(7  =  a^  >  0,  i  =1,2,  ...  .  Given  n  >  0,  find  N  so  that  pN+1  +  pN+2  + 
...  <  n.  Find  b  >  0  small  enough,  such  that 

* 

fdx  <  n .  (15) 

■  {x=f(x)<b} 


According  to  condition  4°,  there  exist  an  open  set  G,  such  that 


GMx:  f  ( x )  >  b>  |  =  0. 


Choose  a  bounded  closed  set  Fg  c:  g,  so  that 


G-F 


fdx  <  n.  Denote  by 


the 


distance  between  Fg  and  the  boundary  of  G,  then  ej  >  0. 
By  the  inequality  of  Bennett  (1962),  we  have 


P(|#(J)/n  -  u(J)|  >_  e)  £  2exp(-  ne2/(2  +  e))  (16) 

P( I #( J) /n  -  u(J)|  >  eu(J))  <  2exp( -  cnu(J))  (17) 


where  e  >  0,  and  c  >  0  is  a  constant  depending  solely  on  e.  Define  two 


subsets  of  J* 
n 


Hj  =  {J:  J  e  J*,  a.  6  J  for  some  i  £  N} 
H9  =  { J  :  J  e  J*,  J  Fn  *4,} 


and  an  event 


2/3+e, 

En  =  (#{J)  >_  n  ,  for  every  J  e  H^U  H^} 


1/3-e, 


Since  the  number  of  J's  contained  in  H,  U  H0  is  of  the  order  0(n  c)  ar 
-1/3+Ej  1  l 

m(J)  >_  bn  for  J  e  and  n  sufficiently  large,  it  is  easy  to 

prove  by  using  (16)  that 


cn1/3 

P(En)  >  1  -  0(e  Cn  ) 


where  c  does  not  depend  on  n. 

Now  define  the  event 

-l/3+e./2 

Fn  =  (min{p(Ini):  i  «  1,  cft}  <  n  } 

We  are  going  to  prove  that  for  any  given  positive  integer  k,  it  is  true 


P(Fn)  =  0(n'k). 


For  this  purpose ,  define  Dn  =  { (x^ ) ,  ....  x^ )  :  |  |  n^  k+1^2 , 

i  =  1,  d).  By  condition  4°,  it  is  readily  shown 


Defi  ne 


There  are  at  most  n 


P( X_i  e  Dn,  i  =  1 . n)  >  1  -  0(n'K) 


-l/3+e./2 

H_  =  (J:  J  e  J*,  JflD  f  $,  p( J)  <  n  1  }. 

*3  n  n 

t  most  n^k+^d  elements  in  H^.  Using  (16),  for  n  sufficiently 


large,  we  have 


k  2/3+e, 

p(Fn)  <  Cn*k  +  £.gH  P(#(j)  >  n  l) 

k  3  -1/3+e, /2 

-  Cn  +  ^JeH  pd^(J)/n  -  u(  J)  |  >  n 

3  1  1 


<  Cn-K  ♦  n(k+1>d0(e-nl/3 


)  =  0(n~k) 


which  proves  (19) 


This,  combining  with  (19)  (choosing  k  =  2  in  (19),  shows  that  with  proba¬ 
bility  one  |n./n  -  p(In1-)|  ±  eyd^),  i  =  1,  ...,  c^ ,  for  n  sufficiently 

large.  In  this  event  we  have 
c 

4  £  e  J  |x<“)|„(l  )  <£Ij6j.|t'u)MJ)  -  e£|x<“>|. 

1=1  n 


Therefore 


and  we  have  proved  that 


lim  1_2  =  0,  a  .s . 


lim  1  if  =  mu*  u  =  1 . d»  a*s< 

n-**  i  =1  n  ~ni  u 


Similar  arguments  shows  that 


lim  l  x^¥j}  =  E(I(U)1(V)), 

i  =1 


=  1,  d,  a.s, 


Now  (14)  follows  from  (25),  (26). 

According  to  an  important  theorem  in  the  theory  of  strong  approximation 
proved  by  Komi  os  and  others  (1975),  one  can  find  a  random  variable 
2 

Zni  •  N(0,a  /n.),  Zpi ,  ....  N  independent,  and 

n 

P(  I  eni  ~Zni  ^  -  ^log  ni^2/,ni^  1  K  exp(-x(log  n^2),  1  *  l,...,cn  (27) 

where  K  >  0,  X  >  0  are  constants  not  depending  on  n,  i.  Put 

Vni  =  eni  Zni *  1  =  1»***»cn 

c  c 

^ni  '  Vni  '  fT  i^1°i Vni  •  Zni  =  Zni  "  IT  J^i^ni’  1  =  1,,,,,cn 

V(n)  =  ('Vni—Vn)’>  Z(n)  =  (Zni . Znc/ 

5n  =  Z(n)W(n)Z(n)  _  Z(n)W(n)^(n)(^(n)W(n )X(n))’^(n)W(n)Z(n)  (28) 


• . 
1 


Then  RSSn  defined  by  (19)  can  be  written  as 


RSSn  =  Cn  +  V(n)W(n)V(n)  +  2V(n)W(n)Z(n) 

'  2V(n)W(n)?(n)(I(n)W(n)I(n))’1?(n)w(n)Z(n) 

=  +  Qi  +  2Q2  -  Q3  -  2Q4.  (29) 

From  the  well-known  normal  theory  and  the  assumptions  made  on  model  (1), 

it  follows  that  the  conditional  distribution  of  £  given  (X,,  ....  1  } 

n  — x  — n 

2  2 
is  x,  a  (so  the  unconditional  distribution  of  £  is  also  x  j).  Now  we 
c  -d  n  c  -d 

n  n 

proceed  to  estimate  Q1  — Q^.  First  take  Q^.  Considering  (6),  we  see  that 
the  event 

cn 

Tn  =  ‘  ZJ  i  (1°9 

o 

has  a  probability)0(n"  ).  Hence,  with  probability  one,  we  can  assert  that 

lvn1-l  1  (^g  ni)2/ni,  i  -  1 . cp  (30) 

for  n  sufficiently  large.  Now,  in  case  (30)  is  true. 

cn  cn  cn 

Q1  =  l  ni^ni  -  E  niVni  -  E  (1<?9  ni)4/V 
1  i=l  1  ni  i=l  1  ni  i=l  1  1 

Considering  (5)  and  (6),  we  have  (again  in  case  (30)  is  tree) 

1/3-e.  ,  -2/3-e,  -l/3-2e,  , 

Qj  ^  n  1(log  n)  n  =  n  ^(log  n)  (31) 


which  proves  that 


lim  Q,  =0,  a.s 


12 


Next  come  to  Qg.  By  Schwartz  inequality, 


«1  i  VZ(n)V)Z(„)  i  «1  .^",4 


But  ^i=l  niZni  "  xc  *  Hence  ^i=l  niZni/cn  =  M1) »  as  n  00 •  From  thl's» 


*n  _  ,2 


(5),  and  (31),  we  see  that 


0,  as  n  -*•  ®. 


In  order  to  deal  with  Q^,  we  shall  in  the  following  use  the  symbol  k 
to  denote  a  quantity  depending  on  ,  . ..,  7  ,  with  the  property  that 

SpP  *4  <  °°  with  probability  one.  may  assume  different  values  in  each 

of  its  appearance.  We  also  use  the  symbol  I  to  denote  an  identity  matrix 
with  appropriate  order.  Under  these  conventions,  we  have  by  (14)  that 

(i(n)M(n)?(n)>'1  i  V'"  <35> 

for  n  sufficiently  large.  Therefore, 

iknV(n)41(£l  v44>4)V(n)' 

By  this,  and  using  again  (14),  we  have 

«3  ±  knV(n)4)W(")V(n)  ’  kA  (36> 

with  probability  one  for  n  sufficiently  large.  By  (32),  we  get 


1 im  =  0,  a .s . 
n-x»  J 


Finally  we  come  to  Q^.  By  Schwartz  inequality, 

Q4  -  Q3Z(n)W(n)^(n)(^n)W(nAn})‘1^(n)W(n)Z(n) 


(38) 


‘  JU  ‘  A1  Jl*  Jl' _4»' 


Arguing  as  in  the  case  of  Q-j,  we  have 

Z(n)^(n)— (n)(-‘ (n)^(n)— (n)^  ^(n)W(n)Z(n)  -  knZ(n)W(n)Z(n)  (39) 

with  probability  one  for  n  sufficiently  large.  From  (36),  (38),  (39),  we 
have  (again  with  probability  one  for  n  sufficiently  large; 

W(n)"(n)Z<»)-  (40) 

In  the  process  of  dealing  with  we  have  shown  that  the  right  hand  side 
of  (40)  tends  to  zero  in  probability  as  n  «.  Hence,  with  probability  one. 


Q4  — *•  0,  as  n  °°.  ( 

From  (29),  (32),  (34),  (37),  (41),  and  the  observation  made  on  £n»  we 
finally  reach  (13).  Theorem  1  is  proved. 


v.  .  . 


+*+*+**  oV/c  V.jC\ S-TX.Vjl.''  1-VL‘jiAa.U.Vi 


5.  ESTIMATION  OF  THE  VARIANCE  OF  ERROR 


t 


In  this  section  we  shall  give  an  estimate  of  the  variance  a  .  For 

’  2 

our  purpose,  we  need  an  estimator  ■'  which  tends  to  a  with  a  rate  as 

0(cn-1/2). 

We  shall  consider  separately  the  two  cases  that  1  .  d  <  3,  2  ,  d  >  3. 
For  the  first  case,  we  choose  e1 ,  satisfying  (2)  and 


e£  <  ( 9e^  +  1 ) /I 2 


(42) 


and  define 


Cn  "i 


Jn  =  J,  ni^  -  /(Nn  ~  Cn) 


(43) 


For  the  second  case,  we  choose  Ej,  e3>  such  that 


0  <  ej  <  1/3,  2(1  -  e3)/d  >  ^  -  -y. 


(44) 


-d-«3>/<*  d 

Define  qn  =  n  .  Decompose  R  into  a  set  Jn  of  supercubes  having 

the  form  (3),  but  in  (3)  is  changed  to  qn.  For  each  J  e  Jn,  write  the 

elements  in  {"X. ,  . .  .,Y  >flj  as  {"X  ,(1) . J  .(n.)}.  The  observations 

— i  — n  — n  j  — n  j  j 

of  Y  corresponding 
Finally  we  define 


of  Y  corresponding  to  these  _X 1  s  is  written  as  {YnJ(l),  •••»  Ynj^nj^* 


0n  =  ^JeJ Jj=l^YnJ(^  '  YmP  ^JeJ  ^nJ  ' 
n  n 


(45) 


wher*  vnj  =  Tj=lVnJ(J)/n.r 


THEOREM  2.  Under  model  (1)  and  suppose  that  Hq  is  true,  then  with 


probability  1 


ST(ol  •  °2)  °*  as  n 


(46) 


-  .1 


Proof.  We  shall  only  give  the  details  for  the  case  d  <3,  as  the 


case  d  >  3  can  be  dealt  with  in  an  entirely  similar  wav.  Put  bni- (j)  = 


(.X^U)  then  I  t>n  i  ( j )  J  <_  ✓3"||  B  ||  £p .  From  (18),  we  have 


I  (Yni(j)  -  V  . )2  =  I  (e„,(j)  -  e  . )' 


*  T  bni(J>  +  2  J  bni(J>(enilj>  ‘  }  - 

J=1  J=1 


Put  fT  =  N  -  Cn*  In  view  of  (5),  in  order  to  prove  (46),  it  is  enough 


to  verify  the  following  assertions  (all  with  probability  one). 


1/6-C./2  p*  Cn  ni 

1°.  n  t  /n'J^O,  where  T  «  l  l  (eMj)  -  od). 

n  n  n  1=1  j=l  n1 


VNi 


.2  /  •  \  2, 


1/6-C./2  cn 

2  •  "  JjVm'";  °- 


1  /6-e.  /2  Cn  ni 

3°.  n  1  l  l  bp.(j)/N'  0. 

i=l  j=l  ni  n 


„  p*  l/6-e,/2Cn  "i 

4.  Sn—  0,  where  Sn  =  n  ^  ^1bni(j)(eni(j')  ‘  eni)/Nn* 


Under  model  (1),  T  is  a  sum  of  N  iid.  random  variables  with  zero 
n  n  2/3+e,  ,  2/3+e, 

mean  and  finite  variance  v.  By  (6),  we  have  FT  >.  n  -  1  >  ^  n 


Hence  (with  probability  one) 


1/6-C./2  c  1/2+3e,/2  ,  -3e, 

P*( | n  1  Tn/Np|  1  e)  1  P*( |T  1  1  |n  1  )  <  -^vn  1  -  0. 

e 


This  proves  1°,  For  2°,  notice  that  E(n.en/)  =  a ,  hence 


l/6-e,/2Cn  ?  1/6-e,/2  -5e,/2 

E*(n  .^1nieni/NA)  -  n  Cn/Nn  -  2n  "  0 


k 


1 /6-e. /2  Cn  ni  ? 
1  v  r  1.2 


l/6-e,/2 


1,  .Jbnitj,/Nn  ilT  d"eH2jnn  * 
j=i  n 


-  TP-  dilBirn 


2  -  ^(2/3-2e2)  +  (l/6-£l/2) 


under  the  condition  0  <  <  1/3,  0  <  e2  <  (9e^  +  1)/12  and  d  <  3,  we 

have  -(2/3  -  2c2)/d  +  (1/6  -  e^/2)  <  0.  Therefore,  the  right  hand  side 
tends  to  zero  as  n  ». 

c  c 

Finally  we  come  to  4°.  Since  the  sum  T.0.  y._.  d  .(j)(e  .(j)  -  e  .)  can  be 

m  m  m 

c  n. 

written  in  a  form  I1=1IJ-.1dni  ( j)eni  ( j)  with  I dpi ( j)  |  <  [bn - ( j)  [ ,  it  follows  that 


1/3-e,  Cn  ni 


•  /  ^  c-i  M  1  9  9  9 

Var*(S  )  <  n  [  I  b;.(j)oV. 

n  i=l  j=l  n1  n 


It  is  easy  to  see  that  the  right  hand  side  is  less  than  a  constant  g^  which 


does  not  depend  on  X^,  ...»  Xn,  and  g^  0  as  n  +  Also,  E*(Sn)  =  0. 
Therefore , 


E*S2  =  Var*Sn  +  (E*(Sn))2 


<  g 
—  an 


0, 


from  which  4  follows,  and  Theorem  2  is  proved. 

A  more  tedious  argument  allows  us  to  prove  that 


lim  /C^"(a2  -  o2)  =0,  a.s. 

n-H» 


■ 


6.  A  LARGE  SAMPLE  DETERMINATION  OF  C  IN  TEST  (12) 


We  are  now  in  a  position  to  prove  the  following  theorem,  which  gives 
a  large  sample  determination  of  C  in  test  (12). 


THEOREM  3.  Under  model  (1)  and  suppose  that  the  null  hypothesis  HQ 
is  true,  the  distribution  of  the  statistic 

T;  4  ^RSSn/  l  -  /2(cn  -  d) 
converges  to  the  standard  normal  N(0,1)  as  n  -*■  <=°. 


Proof.  From  Theorem  1  it  follows  that  the  conditional  distribution 

of  Tn  =  ^RSSn/a2  -  *4(Cn  -  d)  given  (Xj,  X2>  ...}  tends  to  N(0,1),  with 

2  2  2 

probability  one.  Since  |on  -  o|  _<  |on  -  a  |,  we  have 


|  =  Jmsn- 

n 


aon 


(48) 


By  Theorem  1,  with  probability  one  RSSn/cn  =  0p*(l).  Therefore,  from  (48) 
and  Theorem  2,  we  have  T^  -  T*  q,  with  probability  one.  Hence  we  have 
proved  that  with  probability  one,  the  conditional  distribution  of  T*  given 
(Xj,  X2,  ...}  tends  to  N(0,1),  as  n  -►  »,  and  the  same  holds  true  for  the 
unconditional  distribution  of  T*.  Theorem  3  is  proved. 

From  Theorem  3  it  follows  that  if  we  choose 


C  =  {/2(c  -  d")  +  u  r/2 

n  ao 


in  (12),  where  u  is  defined  by 


u 

(  a 


0  rx  /2dx//?7  = 


(49) 


1  -  aQ ,  then  the  test 


is  asymptotically  similar  with  size  aQ. 


7.  CONSISTENCY  OF  THE  TEST 


Denote  by  Qn(F)  the  power  of  the  test  defined  by  (12)  and  (49),  where 
F  is  the  joint  distribution  of  (Y,Y).  The  following  theorem  shows  that 
this  test  is  consistent  over  a  wide  class  of  alternatives. 

THEOREM  4.  Suppose  that  the  following  conditions  are  satisfied: 

1°.  E||7||2  <  COV(X)  >  0,  EY2  <  «. 

2°.  The  distribution  of  J  has  no  singular  component. 

3°.  The  closure  7^  of  the  set  A  of  all  atoms  of  Y  possesses  Lebesgue 
measure  zero. 

4°.  The  closure  A2  of  all  discontinuity  points  of  m(x)  =  E(Y|Y  =  X) 
possesses  Lebesgue  measure  zero. 

5°.  P(Y  <  y|Y  =  x)  depends  only  on  y  -  m(x). 

6°.  There  exists  no  linear  function  a  +  0'x  such  that 

P(m(Y)  =  a  +  0  'X_)  =  1 . 

Then,  denoting  the  distribution  of  (Y,Y)  by  F,  we  have 

lim  Q  (F)  =  1.  (50) 

n-**> 

Proof.  First  choose  d+2  points  t^ ,  ....  td+2,  such  that  the  points 

(t{,  m(t1)),  ....  (t^+2,  m(td+2))  (51) 

do  not  lie  on  a  common  hyperplane  in  Rd+*.  If  7  possesses  no  absolute 
continuous  component,  then,  by  conditions  1°  and  6°,  we  can  choose  such 
points  tj,  ...,  td+2  from  the  atom  set  A.  If  Y  has  an  absolute  continuous 
component  with  density  f,  then,  according  to  conditions  1°,  3°,  4°,  6°,  we 
can  choose  tj . td+2  from  the  set  Rd  -  With  the  additional 
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property  that  there  exists  two  constants  >  0,  h2  >  0,  such  that  for  any 
I  =  {(x^,  ....  x^):  bi  <  x<’>  <  b-  +  h ,  i  =  1,  ....  d}  with  h  e  (0,h1), 


we  have 


fdx  >_  h2h 


For  large  n,  we  can  find  d+2  distinct  supercubes  Jnl . Jn,d+2 


j*  (see  (3)),  such  that  t,  eJ.,  i  =1,  ...»  d+2.  Denote  the  elements  of 
n  '  i  m 


{X1S  ....  In>nJni  by  Xnl(  j) .  j  =  1.  ....  V  In  the  same  way  as  we  did 


in  Section  2,  we  define  Yni(j),  Xni .  Yn1  •  Fit  a  hyperplane 


y  =  a  +  S'x 
J  n  n 


to  the  d+2  points  (Y. ,  Y  .),  i  =  1,  ....  d+2,  by  the  weighted  least  squares 

— II 1  Ml 

method  with  weights  n1#  ....  nd+2.  The  weighted  residual  sum  of  squares  is 


IV 


Rn  =  ^i=l  ni  (Yni  '  an  "  Biwii^ 


For  definiteness,  we  assume  that  the  points  t^,  ...,  td+2  are  chosen  from 


the  set  Rd  -  (A.  UAj.  In  this  case,  from  (52)  and  the  fact  that  £n  = 
-(1/3-eJd 

n  .  With  probability  one  we  have 


2/3+e, 


n.  >  h2n  V2 ,  i  =  1 . d+2 


for  n  sufficiently  large. 


Given  6,  >  0,  we  can  find  nn  such  that  for  n  >  n( 


$up{m(x):  xeJ  . )  -  inf{m(x):  xeJn^ >  <  6^/2,  i  -  1,  ...»  d+2.  (54) 


Use  again  P*  =  P*(Xj»  X2»  •••)  the  conditional  probability  distribution 

given  X j ,  .  By  (53),  (54),  it  is  easy  to  see  that 

-2/3-e 

P*dYni  '  m(ti)l  -  61}  -  6i22h2lo2n  i=1’*’*’d+2  (55) 


wsww 


ffl 


with  probability  one  for  n  sufficiently  large. 


Since  the  points  (51)  do  not  lie  simultaneously  on  any  hyperplane, 
and  since  X^.  -*  t. ,  i  =  1,  d+2,  as  n  »,  it  follows  that  there 

exists  constant  62  >  0  not  depending  on  n  and  (Y. ,Yi ) ,  i  =  1,  2,  .... 
such  that 

sup{|m(t.)  -  an  -  8^Xni I :  i  =  1,  ....  d+2}  > 

for  n  sufficiently  large.  Choose  =  6^/2  in  (54).  Then  if  the  inequal¬ 
ities  | Y  .  -  m(ti)j  <  6^  all  hold  true,  we  shall  have 


>_  6^  minin^ . 


nd+2} 


(56) 


for  n  sufficiently  large.  Summing  up  all  the  arguments  above,  we  reach 
the  following  conclusion: 


lim  P(Rn  >  n2/3)  =  1. 
n-*» 


(57) 


Obviously,  we  have  RSS  >  R  .  Hence  by  (57) 

n  —  n 

lim  P(RSSn  >  n2/3)  =  1.  (58) 

n-** 


2 

Now  consider  defined  by  (45).  The  equality  (47)  still  holds  if 

n. 

we  redefine  bni(j)  =  m(Xni(j))  -  ,  with  mni  =  Ij=1  m( Xn1  ( j) )/ni .  We 

have 


cn  ni 


J,  J.  bn1<J>/N 

1=1  J=1 


1  l  l  «"2(x  .(j))/N 
i  =1  j=l  _f11 


< 

n  - 


(59) 


It  can  be  shown  that  there  exists  constant  q  >  0,  such  that  with  probabil¬ 
ity  one  we  have 

Nn  >  qn  (60) 


for  n  sufficiently  large  (see  the  end  of  this  section).  From  (59),  (60), 
using  the  strong  law  of  large  numbers,  we  get 

cn  n. 

lira  sup  l  l  b2(j)/N  <  q_1E(m2(X))  <  q_1E(Y2)  <  »,  a.s.  (61) 

n-*=°  i  =  1  j=l 


Employing  the  method  of  handling  Sn  in  the  proof  of  Theorem  2,  we  get 

c  n. 

-2„-2  2  r  r  l2  ,  . 


cn  "i 


P*(l  I  I  bni(j)(enj(j)  -eBl)|/Nn>£) 

1-1  J-l  1=1  J=1 


c  n . 

-2„-2  2  v  v  2/ *!t 


<  e’^N’V  ^  ^(^.(j))  <  e"V2a2  ^m2(X.)  -  0,  a.s.  (62) 


Finally,  we  have 


Cn  "i 


l  Menl-(j)  -  en.)  /N 
i=l  j=l 


2 ...  P* 


niVJ'  "  cni '  '"n  °* 


(63) 


Combining  (61)-(63),  we  see  that  there  exists  constant  M  <  °°,  such  that 


lim  P(o  <  M)  =  1. 

n-x» 


(64) 


From  (58),  (64),  it  follows  that 


lim  P(RSS  /o2  >  n2/3M_1)  =  1. 


n-*» 


n  n 


But  by  (5) ,  we  have 

C  <  (/2(c  -  d)  +  u  )2  <  8c  <  8n1/3  <  M_1n2/3 


(65) 


for  n  sufficiently  large.  This  means 


lim  P(RSSn/o^  >C)=1 
n-**> 


which  is  no  other  than  (50),  and  the  theorem  is  proved. 


Proof  of  (60).  If  _x  has  at  least  one  atom,  then  clearly  (60)  holds. 

Otherwise  we  choose  a  sufficiently  large  such  that 

fdx  >  1/2,  I  =  {(x(1),  ...,  x(d)):  |x(i)j  <  a,  i  =  1,  ....  d}. 

JI 

d  1/,3“g2 

I  contains  (2a)  n  elements  of  J*.  Put 

-1  d  l/3-e2 ) 

Hn  =  {J:  JeJ*,  Jcl,  P(J)  >  4  x(2a)  an  c  } 


Evidently  we  have 


d  1/3_e2 

Using  (16),  and  noticing  that  Hn  has  at  most  (2a)  n  members,  we 

obtain 

,  .  -(1/3-eJ 

P(|#(J)/n  -  p(J)|  ^8  A(2a)  an  ,  for  at  least  one  J  e  H  ) 

,  1/3-e,  l/3+2e,  ~ 

<  (2a)an  ^2exp(-cn  ;  =  0(n  ;  (67) 


where  c  >  0  does  not  depend  on  n.  From  (67)  we  see  that,  with  probability 
one,  we  have 

1  d  -^/3-g?) 

| #( J)/n  -  p( J) |  <  8~i(2a)”an  c  ,  for  all  J  e  Hn  (68) 


for  n  sufficiently  large.  But  when  (68)  holds  we  shall  have 

1  d  -d/3-eo) 

Nn  >  l  #( J)  >  n£JeH  lp(J)  -  8-1(2a)"dn  ] 

JeH  n 

n 

n 

on  account  of  the  fact  that  >  e.  and  (66).  This  shows  (60)  holds  with 


8.  ASYMPTOTIC  POWER  OF  THE  TEST 

Suppose  that  (_X,Y^),  (_X,Y^),  ...  is  a  sequence  of  alternatives, 
where  m^(x)  =  E(Y^n^|X^  =  x)  approaches  some  linear  function  a  +  g'x  as 
the  sample  size  n  increases.  Specifically  we  assume  that 

m^(x)  =  a  +  B'x  +  mQ(x)/gn  (69) 

where  g„  -*■  «  as  n  -*•  ®.  We  are  interested  in  finding  such  q  for  which 
n  3  an 

Q „ ( F ( n ) )  tends  to  some  limit  greater  than  the  size  a  and  smaller  than 
n  o 

one  (F^  is  the  distribution  of  (If.Y^)). 

THEOREM  5.  Suppose  that  (X^.Y^),  (_X^,Y  )  are  iid.  samples  of 

U,Y^),  where  Y.  =  +  e.  ,  i  =  1,  ....  n,  m^(x)  is  defined  by 

(69)  and  X^. ,  e^ ,  i  =1,  ....  n  satisfy  the  conditions  specified  in  model 
(1).  Also,  assume  that  EMq(7)  <  <*>,  and 

lim  gn/(»/n  c"1^4)  *  h~*  e  (0,«).  (70) 

n-*>°  ' 

Define 

1  =  '  7?  f.S  E[m0®  -  “  -  8'S2-  (71) 

Then  we  have 

oo  2 

lim  Q  (F(n))  *  —  e_t  /2dt.  (72) 

n-H»  n  Z 

Proof,  a s  the  detailed  proof  is  tedious,  we  give  only  a  sketch  of 
the  main  points  of  the  proof. 

1.  Since  the  linear  hypothesis  Hg  is  not  assumed  to  be  true,  the 
residual  sum  of  squares  RSSp  (9)  cannot  be  reduced  to  (11).  Under  this 
circumstance,  instead  of  (27),  we  have  to  use  a  strong  approximation  to  Yni 


p(lYn1  ‘  Z*ni  I  -  ^log  ni^  /ni^  -  K  exP(-  *Oog  n. )  ) ,  i  *  1,  ....  cn 


where  Z^.  -  N(uni,  °2/ni)*  with 

uni  -  “  *  6'*n1  +  Jj"l  ">0(Ini(j))/(ni9n>- 

2.  Put  Z  .  =  Z*^  -  a  -  B'X^.  It  is  easy  to  verify  that  the  expres¬ 
sion  (29)  still  holds  true  with  Z  .  so  defined. 

m 

3.  We  have  to  verify  that  (32),  (34),  (37)  and  (41)  still  holds  true 

in  the  present  case.  For  ,  nothing  has  been  changed  and  (31)  is  true. 

2  2 

For  Qg.  notice  that  ZjnjW^Z^/0  obeys  a  non-central  x  distribution 
with  degree  of  freedom  cp  and  non-central  parameter 


c  /n. 
n  /  i 


5n  ' 


It  is  easily  seen  that  as  n  •+  «,  $n  =  ng~'  IEiiIqOQ  +  o ( 1 ) ] .  Since  we  have 
_2 

ngn  =  0(/cT)  in  view  of  (70),  by  (5),  we  have 

1/6-e, /2 

E(Z(n)W(n)Z(N)’  *  °<Cn>  *  °<"  >  <73> 

and  (34)  follows  from  (31)  and  (73).  and  can  be  dealt  with  in  the 
same  manner. 

4.  Given  X^,  X.p»  •••  ♦  the  conditional  distribution  of  E^/o  is 

p,  where 

c  ,<T 
n’  n 

?  P  Cn  n.  -  /  Cn  n.  \  /  Cn  n.  -  -  \ /S  n. 

an  =  n*n'  .I=1ir(^i-An)(Xni-Xn),)yiTVn1.Ir 


—  (X  .  -Xn)  ( X  .-X  ) 1  ](  7  4v  .X  . 
n  -m  -  /v-m  -n  J\i=i  2  m-m 


where 


M  .  _ 

vni  *  £j=l*0(in(J))/ni  -  V  "o  ’  E”o®' 


From  (14)  and  (70),  it  is  easily  seen 


V.IW 


r.r.f.'.f 


6n  =  ^Va^nigOO)  -  ’E[(m0(X)  -  m0)X'  ]V"iE[(mQ(X)  -  mQ)X]  +  o(l)}fi 


=  /c^{min  E(m0(X)  -  a  -  g'X)2  +  o(l)}h2. 

a,g 


(74) 


—  —  2 
Given  {Xj,  X^,  •  ••)»  the  conditional  distribution  of  ?n/o  is  the  same  as 

p 

the  sum  of  iid.  variables  (n*  +  6  /Sc )  ,  i  =1,  ....  c  ,  where  n  n 

i  nn  n  ^  y  ■  *  *  > 

are  iid.,  “  N(0,1).  The  central  limit  theorem  can  be  applied  to  this 
case,  which  gives 


V°2  -  <cn  +  *n> 
/2cn  +  4sn 


N(0,1). 


(75) 


p 

5.  Define  cn  as  in  (45).  We  have  to  verify  that  (46)  still  holds 
true  in  the  present  case.  Starting  from  (47),  in  which  bni (j)  is  redefined 
as 

uni(J>  *  m0(^ni(j))  -  Ijil,n0(2ni(J))/ni) 
it  is  easily  seen  that  the  first  and  third  term  in  the  right  hand  side  of 
(47)  can  be  handled  in  the  same  way  as  before,  and  it  suffices  to  verify 
that 

cn  n’ 

I  l  bjj-(j)/n  =  o(/c“).  (76) 

i=l  j=l  ni  n 


Considering  (46),  (75)  reduces  to 


"i 


.LUni(j)/n  = 


(77) 
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Since 


cn  "i 


c  n . 

n  1 


l  1  uni )/n  —  l  l  mnUni*J*^/n  1  £  m2(X.  )/n  -*  E(mj:(  X 
i  =1  j=l  nl  i=l  j=l  0  n1  i=l  U  1  0  “ 


-2'X)),  a.s . 


Further,  by  (5)  and  (70),  we  have 

-2  1.2-1  7—  -5/6 

9n  if1 "  ^ ; n 

for  n  sufficiently  large.  This  proves  (77). 

6.  From  (45)  and  (75),  we  have 


£  /a  -  (c  +  6  ) 
^n7  n  '  n  n7 


/2c  +  4S2 

n  n 


N(0,1) 


as  n  •»  »,  under  the  conditional  distribution  given  {_X*,  1^’  ...}.  From 
(49) ,  (74) ,  we  have 

(£( c„  -  d)2  +  u  )2/2  -  ( c  +  52) 


n  n' 


lim 

n-x» 


/2c  +  46? 


n  n 

This  leads  to  (72),  concluding  the  proof  of  the  theorem. 

The  statistical  meaning  of  the  quantity  z  is  clear,  since 
min  E[mn(X)  -  a  -  8‘Xl  measures  the  distance  of  mn(X)  to  linearity  in 

a, 8  u  -  -  u  - 

the  MSE  sense.  As  this  distance  increases,  l  decreases,  and  the  asymptotic 
power  of  the  test  (right  hand  side  of  (72))  increases. 

The  theorem  indicates  that,  roughly  speaking,  our  test  has  a  dis¬ 
crimination  power  for  those  Y,  for  which  the  distance  (in  the  sense  of 

MSE)  of  E(Y|1  =  x)  to  the  linear  function  space  is  not  smaller  than 

,  -1/2  1/4,  1/3-e9 

0(n  c  ).  Under  quite  general  conditions,  we  can  prove  that  c  Cn  , 

-1/2  1/4  -5/12-e2/4  n 

and  0(n  cn  )  =  0(n  .  In  principle,  e2  can  be  chosen  arbitrarily 

near  1/3.  So  the  order  for  possible  discrimination  can  be  made  arbitrarily 
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fee 


tv 

s 


-1/2  — 
near  0(n  '  ),  but  this  order  cannot  be  reached,  unless  _X  is  purely  atomic 

with  a  finite  number  of  atoms. 


TO 
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